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Abstract 

In this paper, we make a preliminary interpretation of Cook’s theorem presented in [1]. This interpretation 
reveals cognitive biases in the proof of Cook’s theorem that arise from the attempt of constructing a 
formula in CNF to represent a computation of a nondeterministic Turing machine. Such cognitive biases 
are due to the lack of understanding about the essence of nondeterminism , and lead to the confusion 
between different levels of nondeterminism and determinism , thus cause the loss of nondeterminism 
from the TVP-completeness theory. The work shows that Cook’s theorem is the origin of the loss of 
nondeterminism in terms of the equivalence of the two definitions of NP, the one defining NP as the 
class of problems solvable by a nondeterministic Turing machine in polynomial time, and the other 
defining iVP as the class of problems verifiable by a deterministic Turing machine in polynomial time. 
Therefore, we argue that fundamental difficulties in understanding P versus NP lie firstly at cognition 
level, then logic level. 
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1 Introduction 

The notion of nondeterminism is lost from the current definition of NP, which is reflected in 
the equivalence of the two definitions of NP commonly accepted in the academic community 
[2] [3] [4] [5], the one is the solvability-based definition that defines NP as the class of problems solv¬ 
able by a nondeterministic Turing machine in polynomial time, and the other is the verifiability- 
based definition that defines NP as the class of problems verifiable by a deterministic Turing 
machine in polynomial time. Due to this equivalence, the verifiability-based definition has been 
accepted as the standard definition of NP, which has led to the disappearance of nondeterminism 
from NP, and caused ambiguities in understanding NP, thus P versus NP [6]. 

In the paper entitled What is NP ? - Interpretation of a Chinese paradox: White horse is not 
horse [7], we questioned this equivalence. With the help of a famous Chinese paradox White 

Email address: yu.li@u-picardie.fr (JianMing ZHOU, Yu LI). 


1 



horse is not horse , we interpreted some well-known arguments supporting this equivalence, and 
revealed cognitive biases that cause the confusion between different levels of nondeterminism and 
determinism from the view of recognition of problem. 

In this paper, we make a preliminary interpretation of Cook’s theorem presented in [1] from 
the view of representation of problem, and reveal cognitive biases in the proof of Cook’s theorem. 

The paper is organized as follows. In Section 2, we present an overview of Cook’s theorem. 
In Section 3, we interpret Cook’s theorem based on query machine. In Section 4, we interpret 
Cook’s theorem based on NDTM model. In Section 5, we propose to rectify Cook’s theorem, and 
in Section 6 we conclude the paper. 


2 Overview of Cook’s theorem 

Cook’s theorem is usually stated as [8]: 

Any problem in NP can be reduced in polynomial time by a deterministic Turing machine to 
the problem of determining whether a formula in CNF is satisfiable (SAT). 

However, the original statement of Cook’s theorem was presented in Cook’s paper entitled The 
complexity of theorem proving procedures as [1]: 

Theorem 1 If a set S of strings is accepted by some nondeterministic Turing machine within 
polynomial time, then S is P-reducible to {DNF tautologies}. 

The main idea of the proof of Theorem 1 was described in [1] : 

Suppose a nondeterministic Turing machine M accepts a set S of strings within time Q(n), 
where Q{n) is a polynomial. Given an input w for M, we will construct a propositional formula 
A(w) in conjunctive normal form (CNF) such that A(w ) is satisfiable iff M accepts w. Thus 
-i A(w) is easily put in disjunctive normal form (using De Morgans laws), and ->A(w) is a tau¬ 
tology if and only if w (f S. Since the whole construction can be carried out in time bounded by a 
polynomial in \ w (the length ofw), the theorem will be proved. 

Here S refers to a set of all instances of an NP problem that have solutions, and finding the 
tautology of -> A(w) in DNF is transformed into finding the satisfiability of A(w) in CNF. 

Concerning P-reducibility, it was explained in [1]: 

Here ’’reduced” means, roughly specking, that the first problem can be solved deterministically 
in polynomial time provided an oracle is available for solving the second. 

That is, Cook attempted to construct a formula A(w) in CNF to represent a computation of a 
nondeterministic Turing machine in order to achieve the objective of representing an NP problem 
as SAT problem, however the construction of A{w) is a deterministic and polynomial time process, 
that is, a computation of a deterministic Turing machine. Therefore, how to construct A(w) 
constitutes the proof of Theorem 1. 

We interpret this proof and reveal cognitive biases hidden in it. 
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3 Interpretation of Cook’s theorem based on Query Ma¬ 
chine 

3.1 Query Machine and P-reduciblility 

In order to provide a reasonable basis for P-reduciblility in Theorem 1, Cook introduced a tool 
called query machine , which is a mix of an oracle and a deterministic Turing machine, to replace 
a nondeterministic Turing machine. It is this query machine that sows the seeds of confusion of 
NDTM (NonDeterministic Turing Machine) and DTM (Deterministic Turing Machine) in the 
proof of Theorem 1. 

Let us analyze this query machine. A query machine was defined in [1]: 

A query machine is a multitape Turing machine with a distinguished tape called the query tape, 
and three distinguished states called the query state, yes state, and no state, respectively. If M 
is a query machine and T is a set of strings, then a T-computation of M is a computation of 
M in which initially M is in the initial state and has an input string w on its input tape, and 
each time M assures the query state there is a string u on the query tape, and the next state M 
assumes is the yes state if u eT and the no state if u jLT. We think of an ’oracle’, which knows 
T, placing M in the yes state or no state. 

Then the concept of P-reduciblility was defined based on query machine [1]: 

Definition. A set S of strings is P-reducible (P for polynomial) to a set T of strings iff there 
is some query machine M and a polynomial Q(n) such that for each input string w, the T- 
computation of M with input w halts within Q(\ w \) steps (\ w | is the length of w) and ends in 
an accepting state iff w E S. 

It is not hard to see that P-reducibility is a transitive relation. Thus the relation E on sets of 
strings, given by ( S , T) E E iff each of S and T is P-reducible to the other, is an equivalence 
relation. The equivalence class containing a set S will be denoted by deg (S) (the polynomial degree 
of difficulty of S). 

In addition, five NP problems were given as examples to illustrate S and T [1]: 

We now define the following special sets of strings. 

1. The subgraph problem is the problem given two finite undirected graphs, determine whether 
the first is isomorphic to a subgraph of the second. A graph G can be represented by a string 
G on the alphabet {0,1,*} by listing the successive rows of its adjacency matrix, separated 
by *s. We let subgraph pairs denote the set of strings G\ * *G *2 such that G\ is isomorphic 
to a subgraph 0 /G 2 . 

2. The graph isomorphism problem will be represented by the set, denoted by {isomorphic graph 
pairs}, of all strings G\ * *(^2 such that G 1 is isomorphic to G 2 . 

3. The set {Primes} is the set of all binary notations for prime numbers. 

4. The set {DNF tautologies} is the set of strings representing tautologies in disjunctive normal 
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form. 

5. The set D3 consists of those tautologies in disjunctive normal form in which each disjunct 
has at most three conjuncts (each of which is an atom or negation of an atom). 


Query Machine Deterministic Turing Machine 



(a) (b) 


Fig. 1: (a) A query machine accepts w\ (b) A deterministic Turing machine accepts w 

We interpret how a query machine M accepts an instance w of an NP problem in polynomial 
time, which is illustrated in Fig. 1(a). 

S refers to a set of strings that represents all instances that have solutions, for example, S refers 
to a set of instances G\ * *G 2 of the graph isomorphism problem such that G\ is isomorphic to 
G 2 ■ T refers to a set of formulas in DNF that are tautologies. 

Initially, M is in the initial state qo and has w representing an instance of a NP problem as 
input. Then, M assures the query state qQ ue ry where there is a string u representing a formula in 
DNF as input for an oracle and this oracle instantly determines whether u G T, that is, whether 
u is tautology. Finally, according to the obtained reply, if u G T then the oracle places M in the 
yes state qy and accepts w; or if a f T then the oracle places M in the no state <y v and refuses 
w. 

In this way, S is said to be P-reducible to T. 

Therefore, a query machine is in fact a formalized oracle. But it should pay special attention 
to the essence of oracle. The existence of an oracle is just a hypothesis rather than a fact, so 
it is just theoretically valid, but not in practice, while a deterministic Turing machine is either 
theoretically or practically valid. That is, oracle and DTM are two concepts situated at different 
levels (see Fig. 1). 

Unfortunately, it seems that Cook did not realize this fundamental difference, when he inter¬ 
preted P-reducibility in [1]: 

By reduced we mean, roughly speaking, that if tautology hood could be decided instantly (by an 
’’oracle”) then these problems could be decided in polynomial time. In order to make this notion 
precise, we introduce query machines, which are like Turing machines with oracles in [1]. 

In other words, Cook made a direct logic deduction between oracle and DTM, but this is a 
question begging, because the existence of an oracle itself needs to be proved. It is this cognitive 
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bias that leads to the confusion of query machine and DTM, thus the confusion of NDTM and 
DTM in the proof of Theorem 1. 

3.2 Proof of Theorem 1 

The proof of Theorem 1 consists in constructing u from w, where w is an instance of an NP 
problem and u is a formula A(w) in CNF (see Fig. 1(a)), through representing a computation 
of a nondeterministic Turing machine for w. 

This idea was explained in [1]: 

Suppose a nondeterministic Turing machine M accepts a set S of strings within time Q(n), 
where Q(n ) is a polynomial. Given an input w for M, we will construct a propositional formula 
A(w ) in conjunctive normal form (CNF) such that A(w) is satisfiable iff M accepts w. 

However, the construction of A(w) is a deterministic and polynomial time process, so Cook 
discretely replaced a nondeterministic Turing machine by a deterministic Turing machine through 
a query machine (see Fig. 1). 

We interpret how it could happen. 

Originally the above M refers to a nondeterministic Turing machine, but this M has discretely 
changed in the following [1]: 

We may as well assume the Turing machine has only on tape, which is infinite to the right but 
has a left-most square. Let us number the squares from left to right 1,2,... Let us fix an input w 
to M of length n, and suppose w G S. Then there is a computation of M with input w that ends 
in an accepting state within T = Q(n) steps. The formula A(w) will be built from many different 
proposition symbols, whose intended meaning, listed below, refer to such a computation. 

That is, A(w) is built to represent a computation of M. Let us interpret the meaning of this 
computation through the analysis of the construction of A(w) [1]: 

Suppose the tape alphabet for M is {a \,..., <q} and the set of states is {q \,..., q r }. Notice that 
since the computation has at most T = Q(n) steps, no tape square beyond T is scanned. 

Proposition symbols : 

• P l st for 1 < i < l, 1 < s, t < T. P( t is true iff tape square number s at step t contains the 
symbol cq. 

• Q\ for 1 < i < r, 1 < t <T. Q\ is true iff at step t the machine is in state qi. 

• S s> t for 1 < s, t < T) is true iff at step t square number s is scanned by the tape head. 

The formula A(w) is a conjunction BACADAEAFAGAHAI formed as follows. Notice 
A(w) is in conjunctive normal form. 

B will assert that at each step t, one and only one square is scanned. B is a conjunctive 
B\ A B 2 A ... A Bt, where B t asserts that at time t one and only one square is scanned : 

B t = (. Si t t V S 2 >t V, ... V S T ,t) A [ /\l<i<j<Q(n)(^ S i,t v ■ 
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For 1 < s < T and 1 < t < T, C s)t asserts that at square s and time t there is one and only 
one symbol. C is the conjunction of all the C s j- 

D asserts that for each t there is one and only one state. 

E asserts the initial conditions are satisfied : 

E = Qi A Si t i A P^i A P^i A ... A Pffi A P^ +11 ... A Pf i 

Where w — cr ^,..., <T; n ; qo is the initial state and o\ is the blank symbol. 

F, G, and H assert that for each time t the values of the P's, Q's and S's are updated properly. 
For example, G is the conjunction over all t, i, j of G\j, where Gjj asserts that if at time t the 
machine is in state qi scanning symbol <jj, then at time t + 1 the machine is in the state qu, where 
qk is the state given by the transition function for M. 

G{j = A T s=ihQt v ^ S s,t V V Qt+i)- 

Finally, the formula I asserts that the machine reaches an accepting state at some time. The 
machine M should be modified so that it continues to compute in some trivial fashion after reach¬ 
ing an accepting state, so that A(w ) will be satisfied. 

Firstly, let us clarify M. Notice that B, C , D and / refer to just the physical structure of 
M; F, G and H refer to the transition function of M; and E concerns the initial condition of a 
computation of M. 

Such a computation proceeds as below. Starting from E, the assignment of the proposition 
symbols corresponding to following steps are deduced out according to BACADAFAGAH, 
finally the truth of / is deduced out in polynomial time. Since this computation ends in poly¬ 
nomial time Q{n), and F, G and H represent the transition function of M in terms of G\ ■ = 

t\s=i (A Qt V ->S 8)t V -i Pf t V Q* + 1 ), then this computation is in fact a computation of a deterministic 
Turing machine M. In other words, the original nondeterministic Turing machine M has been 
discretely transformed into a deterministic Turing machine Ml 

Then, we interpret the meaning of such a computation by clarifying E. E refers to the assign¬ 
ment of the proposition symbols corresponding to the initial time t = 1, E = Q® A S \ : i A P[\ A 
P^i A ... A P'jj A Pn+i,i ■■■ A Pfi, where Q\ means that M is in the initial state q 0 , Si t i means 
that the tape head scans square number 1, Pf +l;1 ... A Pf A refers to the blank symbols cq after 
<Tj 1 ,..., <7j n , and A Pff A ... A P r *”i refers to a string ..., <Ji n on the tape that is claimed 

to refer to an instance w of a problem. 

That is to say, given an instance w of an NP problem, a deterministic Turing machine M de¬ 
termines whether to accept w in polynomial time, and A(w) is built to represent this computation 
(see Fig. 1(b)). 

In other words, the proof of Theorem 1 claims that a deterministic Turing machine can accept 
w like a nondeterministic Turing machine, that is, NDTM is confused with DTM by means of 
query machine ! 
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4 Interpretation of Cook’s theorem based on NDTM model 


Later researchers must have noticed something wrong in the above proof, since they completely 
abandoned the concept of query machine, and proposed a NDTM (NonDeterministic Turing 
Machine) model (see [5], p. 30): 

The NDTM model we will be using has exactly the same structure as a DTM (Deterministic 
Turing Machine), except that it is augmented with a guessing module having its own write-only 
head. 

A computation of such a machine takes place in two distinct stages (see [5], p. 30-31): 

The first stage is the ’’guessing” stage. Initially, the input string x is written in tape squares 
1 through \ x | (while all other squares are blank), the read-write head is scanning square 1, the 
the write-only head is scanning square -1, and the finite state control is ’’inactive”. The guessing 
module then directs the write-only head, one step at a time, either to write some symbol from T in 
the tape square being scanned and move one square to left, or to stop, at which point the guessing 
module becomes inactive and the finite state control is activated in state q 0 . The choice of whether 
to remain active, and, if so, which symbol from T to write, is made by the guessing module in a 
totally arbitrary manner. Thus the guessing module can write any string from T* before it halts 
and, indeed, need never halt. 

The ’’checking” stage begins when the finite state control is activated in state q 0 . From this 
point on, the computation proceeds solely under the direction of the NDTM program according 
to exactly the same rules as for a DTM. The guessing module and its write-only head are no 
longer involved, having fulfilled their role by writing the guessed string on the tape. Of course, 
the guessed string can (and usually will) be examined during the checking stage. The computation 
ceases when and if the finite state control enters one of the two halt states (either qy or qy) and 
is said to be an accepting computation if it halts in state qy. All other computations, halting or 
not, are classed together simply as non-accepting computations. 


Deterministic Turing Machine with Guessing Module Nondeterministic Turing Machine 

Output: 

s I Y/N Accept ln P ut: *1 

(q 0 ) ' °™ Kq/qJ) /non-accept {%) 


(a) (b) 


Input: x 


Guessing 

Module 


Output: 

Accept 

/refuse 

(q Y /q N ) 


Fig. 2: (a) A machine of the NDTM model accepts x ; (b) A nondeterministic Turing machine accepts x 


That is, for a given instance x of an NP problem, a guessing module finds a certificate s of 
solution, then s is checked by a deterministic Turing machine. If s is a solution, the computa¬ 
tion halts in state qy and it is said to be an accepting computation; if s is not a solution, the 
computation halts in state qy and it is said to be a non-accepting computation (see Fig. 2(a)). 
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However, this non-accepting computation is completely different from the refusing computation 
of a nondeterministic Turing machine (see Fig. 2(b)), because this non-accepting computation 
has no sense, that is, a deterministic Turing machine with a guessing module cannot determine 
whether w is accepted or refused in this case. In other words, the NDTM model is not NDTM. 

Unfortunately, it seems that these researchers did not really realize the fundamental difference 
between the NDTM model and NDTM, and still attempted to construct a formula A(w) in 
CNF to represent a computation of a nondeterministic Turing machine through a machine of 
the NDTM model. Consequently, the cognitive bases in Cook’s theorem have been hidden more 
deeply in terms of the equivalence of the solvability-based definition and the verifiability-based one 
(see [3] section 7.3). 


5 Rectification of Cook’s theorem 

In fact, a formula A(w) in CNF can only represent a computation of a deterministic Turing 
machine, specially a polynomial time verification for any problem in NP as well as in P (see Fig. 

3 ). 


Deterministic Turing Machine 



Output: Y/N 

(q Y /q N ) 


Fig. 3: A deterministic Turing machine verifies a certificate s of solution 

Let us rectify the proof of Theorem 1. 

At the initial time t = 1, a certificate s of solution for an instance w is put on the tape squares, 
then E refers to the assignment of the proposition symbols concerning the initial time t — 1. From 
E, the assignment of the proposition symbols concerning other times are deduced out according 
to BACADAFAGAH, and finally the truth of I is deduced out. If / = 1, it means that s is 
a solution for w; otherwise if / = 0, s is not a solution for w. 

Therefore, E consists of two parts : E = E - L A E 2 , where E- L = Q® A and E 2 = P{\ A P 2 l A 
... A PfC A Pn+i,i • • • A Pf i- Ei refers to the initial state qo of M as well as the tape head scanning 
square number 1, while E 2 refers to a given certificate s. 

Here, it should pay special attention to E 2 . Since E 2 represents the assignment of the proposi¬ 
tion symbols concerning s and E 2 = 1, then E 2 should not appear in A(w). Unfortunately, this 
key point has never been discussed in the literature [1] [5], so that it prevents from interpreting 
correctly the meaning of A(w). 

Therefore, BAC ADA EiAFAGAHAl refers to the verification about the truth of any 



certificate, not limited to a given certificate s. Just in this sense, BACADAEiAFAGAHAl 
becomes a function of w, and it can be denoted as A(w) = BACADAEiAFAGAHAL 

In other words, it is BACADAEiAFAGAHAl, rather than BACADAEAFAGAHAl, 
that is intended to represent an instance w of a problem in terms of A(w). 

Now, we can clarify the real meaning of A(w), denoted in Theorem 0: 

Theorem 0 Any problem verifiable by a deterministic Turing machine in polynomial time can 
be represented as A(w). 

Since an NP problem is polynomially verifiable according to the definition of NDTM, so an 
NP problem can be represented as A{w). Note that any P problem is polynomially verifiable, so 
it can be also represented as A(w) in the same way. 

Furthermore, determining the satisfiability of A(w) corresponds to determining the existence 
of solution for an NP problem, thus it deduces out Theorem 1’ in terms of the usual expression 
of Cook’s theorem: 

Theorem 1’ Any problem solvable by a nondeterministic Turing machine in polynomial time 
can be represented as a SAT problem. 

Theorem 1’ consists of the rectification of the original statement Theorem 1 of Cook’s 
theorem including its proof. Therefore, Theorem 1 is just a corollary of Theorem 0, and the 
relation between Theorem 0 and Theorem 1 is the cause-effect relationship. In other words, 
the two definitions of NP, the verifiability-based definition corresponding to Theorem 0 and the 
solvability-based one to Theorem 1, are not equivalent, because they are situated at different 
levels of concept and have the cause-effect relationship. 


6 Conclusion 


In this paper, we give preliminary interpretation of Cook’s theorem in [1] and reveal the cognitive 
biases in the proof of Cook’s theorem, which leads to the confusion between different levels of 
nondeterminism and determinism , thus the confusion of the solvability-based definition and the 
verifiability-based one of NP, finally causes the loss of nondeterminism from NP. 

The work argues again that the difficulty in understanding P versus NP lies at firstly cognition 
level, then logic level [9]. A similar opinion is also suggested in [10]. 

We hope that this work can evoke reflections from different angles about some fundamental 
problems in cognitive science, and contribute to the understanding of P versus NP. Furthermore, 
we hope that this work can help to understand the complementarity of Chinese thought and 
Western philosophy. 


9 



References 


[1] Stephen Cook, The complexity of theorem proving procedures. Proceedings of the Third Annual 
ACM Symposium on Theory of Computing, pp. 151-158 (1971). 

[2] http://en. wikipedia.org/wiki/NP_(complexity) 

[3] Michael Sipser, Introduction to the Theory of Computation, Second Edition. International Edition 
(2006). 

[4] Stephen Cook, The P versus NP Problem. Clay Mathematics Institute. 
http://www.claymath.org/millennium/P_vs_NP/pvsnp.pdf. 

[5] Michael R. Garey, David S. Johnson, Computers and Intractability: A Guide to the Theory of 
NP-Completeness. W. H. Freeman and company (1979). 

[6] William I. Gasarch, The P=?NP poll. SIGACT News Complexity Theory Column 74. 
http://www.cs.umd.edu//~gasarch/papers/poll2012.pdf. 

[7] Yu Li, What is NP? - Interpretation of a Chinese paradox white horse is not horse, (to appear) 

[8] http://en. wikipedia.org/wiki/Cook-Levin_theorem 

[9] JianMing Zhou, Computability vs. Nondeterministic and P vs. NP. 
http://arxiv.org/abs/1305.4029. 

[10] Scott Aaronson, Why Philosophers Should Care About Computational Complexity, Electronic 
Colloquium on Computational Complexity, Revision 2 of Report No. 108 (2011). 


10 



